Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 9471 |
| Missing cells | 37353 |
| Missing cells (%) | 23.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.2 MiB |
| Average record size in memory | 136.0 B |
Variable types
| Categorical | 2 |
|---|---|
| Numeric | 13 |
| Unsupported | 2 |
Date has a high cardinality: 391 distinct values | High cardinality |
CO(GT) is highly correlated with PT08.S1(CO) and 8 other fields | High correlation |
PT08.S1(CO) is highly correlated with CO(GT) and 8 other fields | High correlation |
NMHC(GT) is highly correlated with CO(GT) and 8 other fields | High correlation |
C6H6(GT) is highly correlated with CO(GT) and 8 other fields | High correlation |
PT08.S2(NMHC) is highly correlated with CO(GT) and 8 other fields | High correlation |
NOx(GT) is highly correlated with CO(GT) and 7 other fields | High correlation |
PT08.S3(NOx) is highly correlated with CO(GT) and 8 other fields | High correlation |
NO2(GT) is highly correlated with CO(GT) and 7 other fields | High correlation |
PT08.S4(NO2) is highly correlated with CO(GT) and 8 other fields | High correlation |
PT08.S5(O3) is highly correlated with CO(GT) and 8 other fields | High correlation |
T is highly correlated with PT08.S4(NO2) and 2 other fields | High correlation |
RH is highly correlated with T | High correlation |
AH is highly correlated with PT08.S4(NO2) and 1 other fields | High correlation |
CO(GT) is highly correlated with PT08.S1(CO) and 8 other fields | High correlation |
PT08.S1(CO) is highly correlated with CO(GT) and 8 other fields | High correlation |
NMHC(GT) is highly correlated with CO(GT) and 9 other fields | High correlation |
C6H6(GT) is highly correlated with CO(GT) and 8 other fields | High correlation |
PT08.S2(NMHC) is highly correlated with CO(GT) and 8 other fields | High correlation |
NOx(GT) is highly correlated with CO(GT) and 7 other fields | High correlation |
PT08.S3(NOx) is highly correlated with CO(GT) and 8 other fields | High correlation |
NO2(GT) is highly correlated with CO(GT) and 7 other fields | High correlation |
PT08.S4(NO2) is highly correlated with CO(GT) and 8 other fields | High correlation |
PT08.S5(O3) is highly correlated with CO(GT) and 8 other fields | High correlation |
T is highly correlated with NMHC(GT) and 3 other fields | High correlation |
RH is highly correlated with T | High correlation |
AH is highly correlated with PT08.S4(NO2) and 1 other fields | High correlation |
CO(GT) is highly correlated with PT08.S1(CO) and 7 other fields | High correlation |
PT08.S1(CO) is highly correlated with CO(GT) and 6 other fields | High correlation |
NMHC(GT) is highly correlated with CO(GT) and 8 other fields | High correlation |
C6H6(GT) is highly correlated with CO(GT) and 7 other fields | High correlation |
PT08.S2(NMHC) is highly correlated with CO(GT) and 7 other fields | High correlation |
NOx(GT) is highly correlated with CO(GT) and 7 other fields | High correlation |
PT08.S3(NOx) is highly correlated with CO(GT) and 7 other fields | High correlation |
NO2(GT) is highly correlated with CO(GT) and 4 other fields | High correlation |
PT08.S4(NO2) is highly correlated with NMHC(GT) and 2 other fields | High correlation |
PT08.S5(O3) is highly correlated with CO(GT) and 7 other fields | High correlation |
T is highly correlated with AH | High correlation |
AH is highly correlated with T | High correlation |
Time is highly correlated with C6H6(GT) and 5 other fields | High correlation |
PT08.S3(NOx) is highly correlated with PT08.S5(O3) and 8 other fields | High correlation |
PT08.S5(O3) is highly correlated with PT08.S3(NOx) and 8 other fields | High correlation |
NOx(GT) is highly correlated with PT08.S3(NOx) and 7 other fields | High correlation |
C6H6(GT) is highly correlated with Time and 9 other fields | High correlation |
RH is highly correlated with T | High correlation |
PT08.S4(NO2) is highly correlated with PT08.S3(NOx) and 8 other fields | High correlation |
CO(GT) is highly correlated with Time and 9 other fields | High correlation |
PT08.S1(CO) is highly correlated with Time and 9 other fields | High correlation |
T is highly correlated with RH and 2 other fields | High correlation |
AH is highly correlated with PT08.S4(NO2) and 1 other fields | High correlation |
PT08.S2(NMHC) is highly correlated with Time and 9 other fields | High correlation |
NO2(GT) is highly correlated with Time and 8 other fields | High correlation |
NMHC(GT) is highly correlated with Time and 9 other fields | High correlation |
Date has 114 (1.2%) missing values | Missing |
Time has 114 (1.2%) missing values | Missing |
CO(GT) has 1797 (19.0%) missing values | Missing |
PT08.S1(CO) has 480 (5.1%) missing values | Missing |
NMHC(GT) has 8557 (90.3%) missing values | Missing |
C6H6(GT) has 480 (5.1%) missing values | Missing |
PT08.S2(NMHC) has 480 (5.1%) missing values | Missing |
NOx(GT) has 1753 (18.5%) missing values | Missing |
PT08.S3(NOx) has 480 (5.1%) missing values | Missing |
NO2(GT) has 1756 (18.5%) missing values | Missing |
PT08.S4(NO2) has 480 (5.1%) missing values | Missing |
PT08.S5(O3) has 480 (5.1%) missing values | Missing |
T has 480 (5.1%) missing values | Missing |
RH has 480 (5.1%) missing values | Missing |
AH has 480 (5.1%) missing values | Missing |
Unnamed: 15 has 9471 (100.0%) missing values | Missing |
Unnamed: 16 has 9471 (100.0%) missing values | Missing |
Date is uniformly distributed | Uniform |
Time is uniformly distributed | Uniform |
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2021-07-04 21:53:51.268218 |
|---|---|
| Analysis finished | 2021-07-04 21:54:28.552104 |
| Duration | 37.28 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 391 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 114 |
| Missing (%) | 1.2% |
| Memory size | 74.1 KiB |
| 03/03/2005 | 24 |
|---|---|
| 07/02/2005 | 24 |
| 18/05/2004 | 24 |
| 20/12/2004 | 24 |
| 19/09/2004 | 24 |
| Other values (386) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 93570 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10/03/2004 |
|---|---|
| 2nd row | 10/03/2004 |
| 3rd row | 10/03/2004 |
| 4th row | 10/03/2004 |
| 5th row | 10/03/2004 |
Common Values
| Value | Count | Frequency (%) |
| 03/03/2005 | 24 | 0.3% |
| 07/02/2005 | 24 | 0.3% |
| 18/05/2004 | 24 | 0.3% |
| 20/12/2004 | 24 | 0.3% |
| 19/09/2004 | 24 | 0.3% |
| 17/08/2004 | 24 | 0.3% |
| 03/06/2004 | 24 | 0.3% |
| 31/07/2004 | 24 | 0.3% |
| 24/05/2004 | 24 | 0.3% |
| 11/01/2005 | 24 | 0.3% |
| Other values (381) | 9117 | |
| (Missing) | 114 | 1.2% |
Length
| Value | Count | Frequency (%) |
| 25/07/2004 | 24 | 0.3% |
| 15/02/2005 | 24 | 0.3% |
| 23/11/2004 | 24 | 0.3% |
| 07/02/2005 | 24 | 0.3% |
| 18/05/2004 | 24 | 0.3% |
| 20/12/2004 | 24 | 0.3% |
| 19/09/2004 | 24 | 0.3% |
| 17/08/2004 | 24 | 0.3% |
| 03/06/2004 | 24 | 0.3% |
| 31/07/2004 | 24 | 0.3% |
| Other values (381) | 9117 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 30180 | |
| / | 18714 | |
| 2 | 14805 | |
| 4 | 8844 | 9.5% |
| 1 | 7902 | 8.4% |
| 5 | 3903 | 4.2% |
| 3 | 2670 | 2.9% |
| 7 | 1656 | 1.8% |
| 8 | 1656 | 1.8% |
| 6 | 1632 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 74856 | |
| Other Punctuation | 18714 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 30180 | |
| 2 | 14805 | |
| 4 | 8844 | 11.8% |
| 1 | 7902 | 10.6% |
| 5 | 3903 | 5.2% |
| 3 | 2670 | 3.6% |
| 7 | 1656 | 2.2% |
| 8 | 1656 | 2.2% |
| 6 | 1632 | 2.2% |
| 9 | 1608 | 2.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 18714 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 93570 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 30180 | |
| / | 18714 | |
| 2 | 14805 | |
| 4 | 8844 | 9.5% |
| 1 | 7902 | 8.4% |
| 5 | 3903 | 4.2% |
| 3 | 2670 | 2.9% |
| 7 | 1656 | 1.8% |
| 8 | 1656 | 1.8% |
| 6 | 1632 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 93570 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 30180 | |
| / | 18714 | |
| 2 | 14805 | |
| 4 | 8844 | 9.5% |
| 1 | 7902 | 8.4% |
| 5 | 3903 | 4.2% |
| 3 | 2670 | 2.9% |
| 7 | 1656 | 1.8% |
| 8 | 1656 | 1.8% |
| 6 | 1632 | 1.7% |
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 114 |
| Missing (%) | 1.2% |
| Memory size | 74.1 KiB |
| 14.00.00 | 390 |
|---|---|
| 13.00.00 | 390 |
| 20.00.00 | 390 |
| 19.00.00 | 390 |
| 18.00.00 | 390 |
| Other values (19) |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 74856 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 18.00.00 |
|---|---|
| 2nd row | 19.00.00 |
| 3rd row | 20.00.00 |
| 4th row | 21.00.00 |
| 5th row | 22.00.00 |
Common Values
| Value | Count | Frequency (%) |
| 14.00.00 | 390 | 4.1% |
| 13.00.00 | 390 | 4.1% |
| 20.00.00 | 390 | 4.1% |
| 19.00.00 | 390 | 4.1% |
| 18.00.00 | 390 | 4.1% |
| 05.00.00 | 390 | 4.1% |
| 04.00.00 | 390 | 4.1% |
| 09.00.00 | 390 | 4.1% |
| 10.00.00 | 390 | 4.1% |
| 12.00.00 | 390 | 4.1% |
| Other values (14) | 5457 |
Length
| Value | Count | Frequency (%) |
| 13.00.00 | 390 | 4.2% |
| 23.00.00 | 390 | 4.2% |
| 20.00.00 | 390 | 4.2% |
| 19.00.00 | 390 | 4.2% |
| 18.00.00 | 390 | 4.2% |
| 05.00.00 | 390 | 4.2% |
| 04.00.00 | 390 | 4.2% |
| 09.00.00 | 390 | 4.2% |
| 10.00.00 | 390 | 4.2% |
| 12.00.00 | 390 | 4.2% |
| Other values (14) | 5457 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 42498 | |
| . | 18714 | |
| 1 | 5067 | 6.8% |
| 2 | 2730 | 3.6% |
| 3 | 1170 | 1.6% |
| 8 | 780 | 1.0% |
| 9 | 780 | 1.0% |
| 4 | 780 | 1.0% |
| 5 | 779 | 1.0% |
| 6 | 779 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 56142 | |
| Other Punctuation | 18714 | 25.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 42498 | |
| 1 | 5067 | 9.0% |
| 2 | 2730 | 4.9% |
| 3 | 1170 | 2.1% |
| 8 | 780 | 1.4% |
| 9 | 780 | 1.4% |
| 4 | 780 | 1.4% |
| 5 | 779 | 1.4% |
| 6 | 779 | 1.4% |
| 7 | 779 | 1.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 18714 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 74856 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 42498 | |
| . | 18714 | |
| 1 | 5067 | 6.8% |
| 2 | 2730 | 3.6% |
| 3 | 1170 | 1.6% |
| 8 | 780 | 1.0% |
| 9 | 780 | 1.0% |
| 4 | 780 | 1.0% |
| 5 | 779 | 1.0% |
| 6 | 779 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 74856 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 42498 | |
| . | 18714 | |
| 1 | 5067 | 6.8% |
| 2 | 2730 | 3.6% |
| 3 | 1170 | 1.6% |
| 8 | 780 | 1.0% |
| 9 | 780 | 1.0% |
| 4 | 780 | 1.0% |
| 5 | 779 | 1.0% |
| 6 | 779 | 1.0% |
| Distinct | 96 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 1797 |
| Missing (%) | 19.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.152749544 |
| Minimum | 0.1 |
|---|---|
| Maximum | 11.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.1 KiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 0.5 |
| Q1 | 1.1 |
| median | 1.8 |
| Q3 | 2.9 |
| 95-th percentile | 4.9 |
| Maximum | 11.9 |
| Range | 11.8 |
| Interquartile range (IQR) | 1.8 |
Descriptive statistics
| Standard deviation | 1.453252036 |
|---|---|
| Coefficient of variation (CV) | 0.675067864 |
| Kurtosis | 2.667779368 |
| Mean | 2.152749544 |
| Median Absolute Deviation (MAD) | 0.8 |
| Skewness | 1.369752778 |
| Sum | 16520.2 |
| Variance | 2.111941481 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 305 | 3.2% |
| 1.4 | 279 | 2.9% |
| 1.6 | 275 | 2.9% |
| 1.5 | 273 | 2.9% |
| 1.1 | 262 | 2.8% |
| 0.7 | 260 | 2.7% |
| 1.7 | 258 | 2.7% |
| 1.3 | 253 | 2.7% |
| 0.8 | 251 | 2.7% |
| 0.9 | 248 | 2.6% |
| Other values (86) | 5010 | |
| (Missing) | 1797 | 19.0% |
| Value | Count | Frequency (%) |
| 0.1 | 33 | 0.3% |
| 0.2 | 45 | 0.5% |
| 0.3 | 98 | 1.0% |
| 0.4 | 160 | |
| 0.5 | 217 | |
| 0.6 | 244 | |
| 0.7 | 260 | |
| 0.8 | 251 | |
| 0.9 | 248 | |
| 1 | 305 |
| Value | Count | Frequency (%) |
| 11.9 | 1 | |
| 11.5 | 1 | |
| 10.2 | 2 | |
| 10.1 | 1 | |
| 9.9 | 1 | |
| 9.5 | 1 | |
| 9.4 | 1 | |
| 9.3 | 1 | |
| 9.2 | 1 | |
| 9.1 | 2 |
PT08.S1(CO)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1041 |
|---|---|
| Distinct (%) | 11.6% |
| Missing | 480 |
| Missing (%) | 5.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1099.833166 |
| Minimum | 647 |
|---|---|
| Maximum | 2040 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.1 KiB |
Quantile statistics
| Minimum | 647 |
|---|---|
| 5-th percentile | 810.5 |
| Q1 | 937 |
| median | 1063 |
| Q3 | 1231 |
| 95-th percentile | 1508 |
| Maximum | 2040 |
| Range | 1393 |
| Interquartile range (IQR) | 294 |
Descriptive statistics
| Standard deviation | 217.0800373 |
|---|---|
| Coefficient of variation (CV) | 0.1973754237 |
| Kurtosis | 0.3351286502 |
| Mean | 1099.833166 |
| Median Absolute Deviation (MAD) | 142 |
| Skewness | 0.7559073724 |
| Sum | 9888600 |
| Variance | 47123.74258 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 973 | 30 | 0.3% |
| 1100 | 28 | 0.3% |
| 938 | 26 | 0.3% |
| 988 | 26 | 0.3% |
| 925 | 26 | 0.3% |
| 969 | 26 | 0.3% |
| 987 | 25 | 0.3% |
| 984 | 25 | 0.3% |
| 1053 | 25 | 0.3% |
| 970 | 25 | 0.3% |
| Other values (1031) | 8729 | |
| (Missing) | 480 | 5.1% |
| Value | Count | Frequency (%) |
| 647 | 1 | < 0.1% |
| 649 | 1 | < 0.1% |
| 655 | 1 | < 0.1% |
| 667 | 3 | |
| 669 | 1 | < 0.1% |
| 676 | 1 | < 0.1% |
| 678 | 1 | < 0.1% |
| 679 | 1 | < 0.1% |
| 681 | 1 | < 0.1% |
| 683 | 2 |
| Value | Count | Frequency (%) |
| 2040 | 1 | |
| 2008 | 1 | |
| 1982 | 1 | |
| 1975 | 1 | |
| 1973 | 1 | |
| 1961 | 1 | |
| 1956 | 1 | |
| 1934 | 1 | |
| 1918 | 1 | |
| 1917 | 1 |
NMHC(GT)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 429 |
|---|---|
| Distinct (%) | 46.9% |
| Missing | 8557 |
| Missing (%) | 90.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 218.8118162 |
| Minimum | 7 |
|---|---|
| Maximum | 1189 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.1 KiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 28.65 |
| Q1 | 67 |
| median | 150 |
| Q3 | 297 |
| 95-th percentile | 661.4 |
| Maximum | 1189 |
| Range | 1182 |
| Interquartile range (IQR) | 230 |
Descriptive statistics
| Standard deviation | 204.4599213 |
|---|---|
| Coefficient of variation (CV) | 0.9344098724 |
| Kurtosis | 2.270289034 |
| Mean | 218.8118162 |
| Median Absolute Deviation (MAD) | 94 |
| Skewness | 1.557017103 |
| Sum | 199994 |
| Variance | 41803.8594 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 66 | 14 | 0.1% |
| 29 | 9 | 0.1% |
| 40 | 9 | 0.1% |
| 88 | 8 | 0.1% |
| 93 | 8 | 0.1% |
| 84 | 7 | 0.1% |
| 55 | 7 | 0.1% |
| 95 | 7 | 0.1% |
| 57 | 7 | 0.1% |
| 60 | 7 | 0.1% |
| Other values (419) | 831 | 8.8% |
| (Missing) | 8557 |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 14 | 2 | |
| 16 | 1 | < 0.1% |
| 17 | 4 | |
| 18 | 2 | |
| 19 | 2 |
| Value | Count | Frequency (%) |
| 1189 | 1 | |
| 1129 | 1 | |
| 1084 | 1 | |
| 1042 | 1 | |
| 974 | 1 | |
| 926 | 1 | |
| 899 | 1 | |
| 880 | 1 | |
| 872 | 1 | |
| 840 | 1 |
C6H6(GT)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 407 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 480 |
| Missing (%) | 5.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.08310533 |
| Minimum | 0.1 |
|---|---|
| Maximum | 63.7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.1 KiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 1.7 |
| Q1 | 4.4 |
| median | 8.2 |
| Q3 | 14 |
| 95-th percentile | 24.65 |
| Maximum | 63.7 |
| Range | 63.6 |
| Interquartile range (IQR) | 9.6 |
Descriptive statistics
| Standard deviation | 7.449819698 |
|---|---|
| Coefficient of variation (CV) | 0.7388418008 |
| Kurtosis | 2.488705886 |
| Mean | 10.08310533 |
| Median Absolute Deviation (MAD) | 4.4 |
| Skewness | 1.36153227 |
| Sum | 90657.2 |
| Variance | 55.49981354 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.6 | 84 | 0.9% |
| 2.8 | 82 | 0.9% |
| 3.8 | 79 | 0.8% |
| 4 | 78 | 0.8% |
| 3.1 | 77 | 0.8% |
| 3 | 76 | 0.8% |
| 2.5 | 75 | 0.8% |
| 2.9 | 73 | 0.8% |
| 5.4 | 72 | 0.8% |
| 6 | 71 | 0.7% |
| Other values (397) | 8224 | |
| (Missing) | 480 | 5.1% |
| Value | Count | Frequency (%) |
| 0.1 | 2 | < 0.1% |
| 0.2 | 8 | 0.1% |
| 0.3 | 10 | 0.1% |
| 0.4 | 14 | |
| 0.5 | 20 | |
| 0.6 | 23 | |
| 0.7 | 31 | |
| 0.8 | 25 | |
| 0.9 | 25 | |
| 1 | 30 |
| Value | Count | Frequency (%) |
| 63.7 | 1 | |
| 52.1 | 1 | |
| 50.8 | 1 | |
| 50.7 | 1 | |
| 50.6 | 1 | |
| 49.5 | 1 | |
| 49.4 | 1 | |
| 48.2 | 1 | |
| 47.7 | 1 | |
| 47.5 | 1 |
PT08.S2(NMHC)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1245 |
|---|---|
| Distinct (%) | 13.8% |
| Missing | 480 |
| Missing (%) | 5.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 939.1533756 |
| Minimum | 383 |
|---|---|
| Maximum | 2214 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.1 KiB |
Quantile statistics
| Minimum | 383 |
|---|---|
| 5-th percentile | 562 |
| Q1 | 734.5 |
| median | 909 |
| Q3 | 1116 |
| 95-th percentile | 1420 |
| Maximum | 2214 |
| Range | 1831 |
| Interquartile range (IQR) | 381.5 |
Descriptive statistics
| Standard deviation | 266.8314286 |
|---|---|
| Coefficient of variation (CV) | 0.2841191179 |
| Kurtosis | 0.06324387318 |
| Mean | 939.1533756 |
| Median Absolute Deviation (MAD) | 188 |
| Skewness | 0.56156598 |
| Sum | 8443928 |
| Variance | 71199.01129 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 853 | 25 | 0.3% |
| 800 | 23 | 0.2% |
| 859 | 23 | 0.2% |
| 880 | 23 | 0.2% |
| 985 | 22 | 0.2% |
| 769 | 21 | 0.2% |
| 850 | 21 | 0.2% |
| 776 | 21 | 0.2% |
| 783 | 21 | 0.2% |
| 828 | 20 | 0.2% |
| Other values (1235) | 8771 | |
| (Missing) | 480 | 5.1% |
| Value | Count | Frequency (%) |
| 383 | 2 | |
| 387 | 1 | |
| 388 | 1 | |
| 390 | 2 | |
| 397 | 1 | |
| 399 | 1 | |
| 402 | 2 | |
| 407 | 2 | |
| 408 | 1 | |
| 409 | 1 |
| Value | Count | Frequency (%) |
| 2214 | 1 | |
| 2007 | 1 | |
| 1983 | 1 | |
| 1981 | 1 | |
| 1980 | 1 | |
| 1959 | 1 | |
| 1958 | 1 | |
| 1935 | 1 | |
| 1924 | 1 | |
| 1920 | 1 |
NOx(GT)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 925 |
|---|---|
| Distinct (%) | 12.0% |
| Missing | 1753 |
| Missing (%) | 18.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 246.8967349 |
| Minimum | 2 |
|---|---|
| Maximum | 1479 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.1 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 38 |
| Q1 | 98 |
| median | 180 |
| Q3 | 326 |
| 95-th percentile | 693 |
| Maximum | 1479 |
| Range | 1477 |
| Interquartile range (IQR) | 228 |
Descriptive statistics
| Standard deviation | 212.9791681 |
|---|---|
| Coefficient of variation (CV) | 0.8626244822 |
| Kurtosis | 3.40213437 |
| Mean | 246.8967349 |
| Median Absolute Deviation (MAD) | 100 |
| Skewness | 1.715780799 |
| Sum | 1905549 |
| Variance | 45360.12605 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 89 | 41 | 0.4% |
| 65 | 37 | 0.4% |
| 93 | 36 | 0.4% |
| 41 | 36 | 0.4% |
| 122 | 36 | 0.4% |
| 95 | 35 | 0.4% |
| 180 | 35 | 0.4% |
| 132 | 35 | 0.4% |
| 120 | 34 | 0.4% |
| 51 | 34 | 0.4% |
| Other values (915) | 7359 | |
| (Missing) | 1753 | 18.5% |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 3 | |
| 11 | 4 | |
| 12 | 4 | |
| 13 | 4 |
| Value | Count | Frequency (%) |
| 1479 | 1 | |
| 1389 | 2 | |
| 1369 | 1 | |
| 1358 | 1 | |
| 1345 | 1 | |
| 1310 | 1 | |
| 1301 | 1 | |
| 1290 | 1 | |
| 1253 | 1 | |
| 1247 | 1 |
PT08.S3(NOx)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1221 |
|---|---|
| Distinct (%) | 13.6% |
| Missing | 480 |
| Missing (%) | 5.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 835.4936047 |
| Minimum | 322 |
|---|---|
| Maximum | 2683 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.1 KiB |
Quantile statistics
| Minimum | 322 |
|---|---|
| 5-th percentile | 483 |
| Q1 | 658 |
| median | 806 |
| Q3 | 969.5 |
| 95-th percentile | 1291 |
| Maximum | 2683 |
| Range | 2361 |
| Interquartile range (IQR) | 311.5 |
Descriptive statistics
| Standard deviation | 256.81732 |
|---|---|
| Coefficient of variation (CV) | 0.3073839447 |
| Kurtosis | 2.677558895 |
| Mean | 835.4936047 |
| Median Absolute Deviation (MAD) | 155 |
| Skewness | 1.101729235 |
| Sum | 7511923 |
| Variance | 65955.13586 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 767 | 25 | 0.3% |
| 846 | 25 | 0.3% |
| 733 | 25 | 0.3% |
| 765 | 23 | 0.2% |
| 876 | 23 | 0.2% |
| 720 | 22 | 0.2% |
| 685 | 22 | 0.2% |
| 800 | 22 | 0.2% |
| 891 | 22 | 0.2% |
| 816 | 22 | 0.2% |
| Other values (1211) | 8760 | |
| (Missing) | 480 | 5.1% |
| Value | Count | Frequency (%) |
| 322 | 1 | |
| 325 | 2 | |
| 328 | 1 | |
| 330 | 2 | |
| 334 | 1 | |
| 335 | 1 | |
| 340 | 2 | |
| 341 | 1 | |
| 345 | 1 | |
| 346 | 1 |
| Value | Count | Frequency (%) |
| 2683 | 1 | |
| 2559 | 1 | |
| 2542 | 1 | |
| 2331 | 1 | |
| 2327 | 1 | |
| 2318 | 1 | |
| 2294 | 1 | |
| 2121 | 1 | |
| 2095 | 2 | |
| 2081 | 1 |
NO2(GT)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 283 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 1756 |
| Missing (%) | 18.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 113.0912508 |
| Minimum | 2 |
|---|---|
| Maximum | 340 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.1 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 43 |
| Q1 | 78 |
| median | 109 |
| Q3 | 142 |
| 95-th percentile | 200.3 |
| Maximum | 340 |
| Range | 338 |
| Interquartile range (IQR) | 64 |
Descriptive statistics
| Standard deviation | 48.37010778 |
|---|---|
| Coefficient of variation (CV) | 0.4277086639 |
| Kurtosis | 0.4650321247 |
| Mean | 113.0912508 |
| Median Absolute Deviation (MAD) | 32 |
| Skewness | 0.6217143134 |
| Sum | 872499 |
| Variance | 2339.667327 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 97 | 78 | 0.8% |
| 119 | 77 | 0.8% |
| 117 | 77 | 0.8% |
| 101 | 75 | 0.8% |
| 95 | 75 | 0.8% |
| 114 | 75 | 0.8% |
| 110 | 74 | 0.8% |
| 115 | 73 | 0.8% |
| 116 | 72 | 0.8% |
| 107 | 72 | 0.8% |
| Other values (273) | 6967 | |
| (Missing) | 1756 | 18.5% |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| 11 | 2 | < 0.1% |
| 12 | 2 | < 0.1% |
| 13 | 1 | < 0.1% |
| 14 | 5 |
| Value | Count | Frequency (%) |
| 340 | 1 | |
| 333 | 1 | |
| 326 | 1 | |
| 322 | 1 | |
| 312 | 1 | |
| 310 | 1 | |
| 309 | 1 | |
| 306 | 1 | |
| 301 | 1 | |
| 296 | 1 |
PT08.S4(NO2)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1603 |
|---|---|
| Distinct (%) | 17.8% |
| Missing | 480 |
| Missing (%) | 5.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1456.264598 |
| Minimum | 551 |
|---|---|
| Maximum | 2775 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.1 KiB |
Quantile statistics
| Minimum | 551 |
|---|---|
| 5-th percentile | 883 |
| Q1 | 1227 |
| median | 1463 |
| Q3 | 1674 |
| 95-th percentile | 2029 |
| Maximum | 2775 |
| Range | 2224 |
| Interquartile range (IQR) | 447 |
Descriptive statistics
| Standard deviation | 346.2067935 |
|---|---|
| Coefficient of variation (CV) | 0.2377361875 |
| Kurtosis | 0.07801862433 |
| Mean | 1456.264598 |
| Median Absolute Deviation (MAD) | 221 |
| Skewness | 0.2053885254 |
| Sum | 13093275 |
| Variance | 119859.1439 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1488 | 24 | 0.3% |
| 1580 | 22 | 0.2% |
| 1539 | 21 | 0.2% |
| 1467 | 20 | 0.2% |
| 1638 | 19 | 0.2% |
| 1490 | 18 | 0.2% |
| 1418 | 18 | 0.2% |
| 1473 | 17 | 0.2% |
| 1321 | 17 | 0.2% |
| 1435 | 17 | 0.2% |
| Other values (1593) | 8798 | |
| (Missing) | 480 | 5.1% |
| Value | Count | Frequency (%) |
| 551 | 1 | |
| 559 | 1 | |
| 561 | 1 | |
| 579 | 1 | |
| 601 | 1 | |
| 602 | 1 | |
| 605 | 1 | |
| 621 | 1 | |
| 637 | 1 | |
| 640 | 1 |
| Value | Count | Frequency (%) |
| 2775 | 1 | |
| 2746 | 1 | |
| 2691 | 1 | |
| 2684 | 1 | |
| 2679 | 1 | |
| 2667 | 1 | |
| 2665 | 1 | |
| 2662 | 1 | |
| 2643 | 2 | |
| 2641 | 2 |
PT08.S5(O3)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1743 |
|---|---|
| Distinct (%) | 19.4% |
| Missing | 480 |
| Missing (%) | 5.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1022.906128 |
| Minimum | 221 |
|---|---|
| Maximum | 2523 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.1 KiB |
Quantile statistics
| Minimum | 221 |
|---|---|
| 5-th percentile | 461 |
| Q1 | 731.5 |
| median | 963 |
| Q3 | 1273.5 |
| 95-th percentile | 1761.5 |
| Maximum | 2523 |
| Range | 2302 |
| Interquartile range (IQR) | 542 |
Descriptive statistics
| Standard deviation | 398.4842877 |
|---|---|
| Coefficient of variation (CV) | 0.3895609545 |
| Kurtosis | 0.07861233923 |
| Mean | 1022.906128 |
| Median Absolute Deviation (MAD) | 261 |
| Skewness | 0.6278644976 |
| Sum | 9196949 |
| Variance | 158789.7276 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 825 | 20 | 0.2% |
| 836 | 20 | 0.2% |
| 826 | 19 | 0.2% |
| 926 | 18 | 0.2% |
| 777 | 17 | 0.2% |
| 799 | 17 | 0.2% |
| 923 | 16 | 0.2% |
| 905 | 16 | 0.2% |
| 891 | 16 | 0.2% |
| 949 | 16 | 0.2% |
| Other values (1733) | 8816 | |
| (Missing) | 480 | 5.1% |
| Value | Count | Frequency (%) |
| 221 | 1 | |
| 225 | 1 | |
| 227 | 1 | |
| 232 | 1 | |
| 252 | 1 | |
| 253 | 1 | |
| 257 | 1 | |
| 261 | 2 | |
| 262 | 1 | |
| 263 | 1 |
| Value | Count | Frequency (%) |
| 2523 | 1 | |
| 2522 | 1 | |
| 2519 | 1 | |
| 2515 | 1 | |
| 2494 | 1 | |
| 2480 | 1 | |
| 2475 | 1 | |
| 2465 | 1 | |
| 2452 | 1 | |
| 2434 | 1 |
| Distinct | 436 |
|---|---|
| Distinct (%) | 4.8% |
| Missing | 480 |
| Missing (%) | 5.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.31782894 |
| Minimum | -1.9 |
|---|---|
| Maximum | 44.6 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 13 |
| Negative (%) | 0.1% |
| Memory size | 74.1 KiB |
Quantile statistics
| Minimum | -1.9 |
|---|---|
| 5-th percentile | 4.6 |
| Q1 | 11.8 |
| median | 17.8 |
| Q3 | 24.4 |
| 95-th percentile | 34.5 |
| Maximum | 44.6 |
| Range | 46.5 |
| Interquartile range (IQR) | 12.6 |
Descriptive statistics
| Standard deviation | 8.832115732 |
|---|---|
| Coefficient of variation (CV) | 0.4821595267 |
| Kurtosis | -0.4562738166 |
| Mean | 18.31782894 |
| Median Absolute Deviation (MAD) | 6.3 |
| Skewness | 0.3093567921 |
| Sum | 164695.6 |
| Variance | 78.0062683 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20.8 | 57 | 0.6% |
| 21.3 | 54 | 0.6% |
| 20.2 | 51 | 0.5% |
| 13.8 | 51 | 0.5% |
| 12 | 49 | 0.5% |
| 15.6 | 49 | 0.5% |
| 12.3 | 49 | 0.5% |
| 19.8 | 48 | 0.5% |
| 16.3 | 48 | 0.5% |
| 14.6 | 47 | 0.5% |
| Other values (426) | 8488 | |
| (Missing) | 480 | 5.1% |
| Value | Count | Frequency (%) |
| -1.9 | 1 | |
| -1.4 | 1 | |
| -1.3 | 2 | |
| -1.2 | 1 | |
| -1.1 | 1 | |
| -0.6 | 2 | |
| -0.5 | 1 | |
| -0.3 | 1 | |
| -0.2 | 1 | |
| -0.1 | 2 |
| Value | Count | Frequency (%) |
| 44.6 | 1 | < 0.1% |
| 44.3 | 1 | < 0.1% |
| 43.4 | 1 | < 0.1% |
| 43.1 | 1 | < 0.1% |
| 42.8 | 3 | |
| 42.7 | 1 | < 0.1% |
| 42.6 | 1 | < 0.1% |
| 42.5 | 1 | < 0.1% |
| 42.2 | 2 | |
| 42 | 2 |
| Distinct | 753 |
|---|---|
| Distinct (%) | 8.4% |
| Missing | 480 |
| Missing (%) | 5.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49.23420087 |
| Minimum | 9.2 |
|---|---|
| Maximum | 88.7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.1 KiB |
Quantile statistics
| Minimum | 9.2 |
|---|---|
| 5-th percentile | 20.3 |
| Q1 | 35.8 |
| median | 49.6 |
| Q3 | 62.5 |
| 95-th percentile | 77.9 |
| Maximum | 88.7 |
| Range | 79.5 |
| Interquartile range (IQR) | 26.7 |
Descriptive statistics
| Standard deviation | 17.31689246 |
|---|---|
| Coefficient of variation (CV) | 0.3517248611 |
| Kurtosis | -0.8183745211 |
| Mean | 49.23420087 |
| Median Absolute Deviation (MAD) | 13.3 |
| Skewness | -0.0379280099 |
| Sum | 442664.7 |
| Variance | 299.8747645 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 53.1 | 31 | 0.3% |
| 47.8 | 30 | 0.3% |
| 57.9 | 30 | 0.3% |
| 45.9 | 27 | 0.3% |
| 60.8 | 27 | 0.3% |
| 50.8 | 26 | 0.3% |
| 57.6 | 26 | 0.3% |
| 47.6 | 26 | 0.3% |
| 49.8 | 26 | 0.3% |
| 50.9 | 26 | 0.3% |
| Other values (743) | 8716 | |
| (Missing) | 480 | 5.1% |
| Value | Count | Frequency (%) |
| 9.2 | 2 | |
| 9.3 | 1 | |
| 9.6 | 1 | |
| 9.8 | 1 | |
| 9.9 | 2 | |
| 10 | 2 | |
| 10.2 | 1 | |
| 10.4 | 1 | |
| 10.7 | 1 | |
| 10.9 | 1 |
| Value | Count | Frequency (%) |
| 88.7 | 1 | < 0.1% |
| 87.2 | 1 | < 0.1% |
| 87.1 | 1 | < 0.1% |
| 87 | 1 | < 0.1% |
| 86.6 | 2 | |
| 86.5 | 2 | |
| 86 | 1 | < 0.1% |
| 85.7 | 3 | |
| 85.6 | 1 | < 0.1% |
| 85.5 | 1 | < 0.1% |
| Distinct | 6683 |
|---|---|
| Distinct (%) | 74.3% |
| Missing | 480 |
| Missing (%) | 5.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.025530275 |
| Minimum | 0.1847 |
|---|---|
| Maximum | 2.231 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.1 KiB |
Quantile statistics
| Minimum | 0.1847 |
|---|---|
| 5-th percentile | 0.40085 |
| Q1 | 0.7368 |
| median | 0.9954 |
| Q3 | 1.3137 |
| 95-th percentile | 1.7256 |
| Maximum | 2.231 |
| Range | 2.0463 |
| Interquartile range (IQR) | 0.5769 |
Descriptive statistics
| Standard deviation | 0.403812606 |
|---|---|
| Coefficient of variation (CV) | 0.3937598099 |
| Kurtosis | -0.5600978405 |
| Mean | 1.025530275 |
| Median Absolute Deviation (MAD) | 0.2861 |
| Skewness | 0.2513877555 |
| Sum | 9220.5427 |
| Variance | 0.1630646208 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.1199 | 6 | 0.1% |
| 0.8394 | 6 | 0.1% |
| 0.7487 | 6 | 0.1% |
| 0.9684 | 6 | 0.1% |
| 0.9722 | 6 | 0.1% |
| 0.8736 | 5 | 0.1% |
| 0.9271 | 5 | 0.1% |
| 0.6686 | 5 | 0.1% |
| 0.8325 | 5 | 0.1% |
| 1.0594 | 5 | 0.1% |
| Other values (6673) | 8936 | |
| (Missing) | 480 | 5.1% |
| Value | Count | Frequency (%) |
| 0.1847 | 1 | |
| 0.1862 | 1 | |
| 0.191 | 1 | |
| 0.1975 | 1 | |
| 0.1988 | 1 | |
| 0.2029 | 1 | |
| 0.2031 | 1 | |
| 0.2062 | 1 | |
| 0.2086 | 1 | |
| 0.2157 | 1 |
| Value | Count | Frequency (%) |
| 2.231 | 1 | |
| 2.1806 | 1 | |
| 2.1766 | 1 | |
| 2.1719 | 1 | |
| 2.1395 | 1 | |
| 2.1362 | 1 | |
| 2.1247 | 1 | |
| 2.1195 | 1 | |
| 2.117 | 1 | |
| 2.1164 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Date | Time | CO(GT) | PT08.S1(CO) | NMHC(GT) | C6H6(GT) | PT08.S2(NMHC) | NOx(GT) | PT08.S3(NOx) | NO2(GT) | PT08.S4(NO2) | PT08.S5(O3) | T | RH | AH | Unnamed: 15 | Unnamed: 16 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 10/03/2004 | 18.00.00 | 2.6 | 1360.0 | 150.0 | 11.9 | 1046.0 | 166.0 | 1056.0 | 113.0 | 1692.0 | 1268.0 | 13.6 | 48.9 | 0.7578 | NaN | NaN |
| 1 | 10/03/2004 | 19.00.00 | 2.0 | 1292.0 | 112.0 | 9.4 | 955.0 | 103.0 | 1174.0 | 92.0 | 1559.0 | 972.0 | 13.3 | 47.7 | 0.7255 | NaN | NaN |
| 2 | 10/03/2004 | 20.00.00 | 2.2 | 1402.0 | 88.0 | 9.0 | 939.0 | 131.0 | 1140.0 | 114.0 | 1555.0 | 1074.0 | 11.9 | 54.0 | 0.7502 | NaN | NaN |
| 3 | 10/03/2004 | 21.00.00 | 2.2 | 1376.0 | 80.0 | 9.2 | 948.0 | 172.0 | 1092.0 | 122.0 | 1584.0 | 1203.0 | 11.0 | 60.0 | 0.7867 | NaN | NaN |
| 4 | 10/03/2004 | 22.00.00 | 1.6 | 1272.0 | 51.0 | 6.5 | 836.0 | 131.0 | 1205.0 | 116.0 | 1490.0 | 1110.0 | 11.2 | 59.6 | 0.7888 | NaN | NaN |
| 5 | 10/03/2004 | 23.00.00 | 1.2 | 1197.0 | 38.0 | 4.7 | 750.0 | 89.0 | 1337.0 | 96.0 | 1393.0 | 949.0 | 11.2 | 59.2 | 0.7848 | NaN | NaN |
| 6 | 11/03/2004 | 00.00.00 | 1.2 | 1185.0 | 31.0 | 3.6 | 690.0 | 62.0 | 1462.0 | 77.0 | 1333.0 | 733.0 | 11.3 | 56.8 | 0.7603 | NaN | NaN |
| 7 | 11/03/2004 | 01.00.00 | 1.0 | 1136.0 | 31.0 | 3.3 | 672.0 | 62.0 | 1453.0 | 76.0 | 1333.0 | 730.0 | 10.7 | 60.0 | 0.7702 | NaN | NaN |
| 8 | 11/03/2004 | 02.00.00 | 0.9 | 1094.0 | 24.0 | 2.3 | 609.0 | 45.0 | 1579.0 | 60.0 | 1276.0 | 620.0 | 10.7 | 59.7 | 0.7648 | NaN | NaN |
| 9 | 11/03/2004 | 03.00.00 | 0.6 | 1010.0 | 19.0 | 1.7 | 561.0 | NaN | 1705.0 | NaN | 1235.0 | 501.0 | 10.3 | 60.2 | 0.7517 | NaN | NaN |
Last rows
| Date | Time | CO(GT) | PT08.S1(CO) | NMHC(GT) | C6H6(GT) | PT08.S2(NMHC) | NOx(GT) | PT08.S3(NOx) | NO2(GT) | PT08.S4(NO2) | PT08.S5(O3) | T | RH | AH | Unnamed: 15 | Unnamed: 16 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9461 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9462 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9463 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9464 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9465 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9466 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9467 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9468 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9469 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9470 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |